Active subclustering
نویسندگان
چکیده
Although there are many excellent clustering algorithms, effective clustering remains very challenging for large datasets that contain many classes. Image clustering presents further problems because automatically computed image distances are often noisy. We address these challenges in two ways. First, we propose a new algorithm to cluster a subset of the images only (we call this subclustering), which will produce a few examples from each class. Subclustering will produce smaller but purer clusters. Then we make use of human input in an active subclustering algorithm to further improve results. We run experiments on a face image dataset and a leaf image dataset and show that our proposed algorithms perform better than baseline methods.
منابع مشابه
Semi-supervised and Active Image Clustering with Pairwise Constraints from Humans
Title of dissertation: Semi-supervised and Active Image Clustering with Pairwise Constraints from Humans Arijit Biswas, Doctor of Philosophy, 2014 Dissertation directed by: Prof. David W. Jacobs Department of Computer Science University of Maryland, College Park Clustering images has been an interesting problem for computer vision and machine learning researchers for many years. However as the ...
متن کاملStar Formation in Clusters : Subclustering , Cloud Fragmentation and the Origin of the Stellar IMF Leonardo
We review recent high spatial resolution millimeter continuum and spectral line observations of (proto-)cluster regions. These observations reveal that the mass distribution of prestellar cores is consistent with the initial mass function for field stars suggesting that the IMF is connected to the molecular clouds structure or the cloud fragmentation processes, rather than the details of the st...
متن کاملHuman Bocavirus in Patients with Encephalitis, Sri Lanka, 2009–2010
We identified human bocavirus (HBoV) DNA by PCR in cerebrospinal fluid from adults and children with encephalitis in Sri Lanka. HBoV types 1, 2, and 3 were identified among these cases. Phylogenetic analysis of HBoV1 strain sequences found no subclustering with strains previously identified among encephalitis cases in Bangladesh.
متن کاملThe build - up of the Coma cluster by infalling substructures
We present a new multiwavelength analysis of the Coma cluster subclustering based on recent X-ray data and on a compilation of nearly 900 redshifts. We characterize subclustering using the Serna & Gerbal (1996) hierarchical method which makes use of galaxy positions, redshifts, and magnitudes, and identify 17 groups. One of these groups corresponds to the main cluster, one is the well known gro...
متن کاملSub-clustering in decomposable graphs and size-varying junction trees
Abstract: This paper proposes a novel representation of decomposable graphs based on semi-latent tree-dependent bipartite graphs. The novel representation has two main benefits. First, it enables a form of subclustering within maximal cliques of the graph, adding informational richness to the general use of decomposable graphs that could be harnessed in applications with behavioural type of dat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Computer Vision and Image Understanding
دوره 125 شماره
صفحات -
تاریخ انتشار 2014